Exponentially weighted moving average charts for detecting concept drift

نویسندگان

  • Gordon J. Ross
  • Niall M. Adams
  • Dimitris K. Tasoulis
  • David J. Hand
چکیده

.Classifying streaming data requires the development of methods which are computationally efficient and able to cope with changes in the underlying distribution of the stream, a phenomenon known in the literature as concept drift. We propose a new method for detecting concept drift which uses an Exponentially Weighted Moving Average (EWMA) chart to monitor the misclassification rate of an streaming classifier. Our approach is modular and can hence be run in parallel with any underlying classifier to provide an additional layer of concept drift detection. Moreover our method is computationally efficient with overhead O(1) and works in a fully online manner with no need to store data points in memory. Unlike many existing approaches to concept drift detection, our method allows the rate of false positive detections to be controlled and kept constant over time.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust economic-statistical design of the EWMA-R control charts for phase II linear profile monitoring

Control charts are powerful tools to monitor quality characteristics of services or production processes. However, in some processes, the performance of process or product cannot be controlled by monitoring a characteristic; instead, they require to be controlled by a function that usually refers as a profile. This study suggests employing exponentially weighted moving average (EWMA) and range ...

متن کامل

Mixed Exponentially Weighted Moving Average-Cumulative Sum Charts for Process Monitoring

The control chart is a very popular tool of statistical process control. It is used to determine the existence of special cause variation to remove it so that the process may be brought in statistical control. Shewhart-type control charts are sensitive for large disturbances in the process, whereas cumulative sum (CUSUM)–type and exponentially weighted moving average (EWMA)–type control charts ...

متن کامل

Fuzzy exponentially weighted moving average control chart for univariate data with a real case application

Statistical process control (SPC) is an approach to evaluate processes whether they are in statistical control or not. For this aim, control charts are generally used. Since sample data may include uncertainties coming from measurement systems and environmental conditions, fuzzy numbers and/or linguistic variables can be used to capture these uncertainties. In this paper, one of the most popula...

متن کامل

The Detection of Shifts in Autocorrelated Processes with Moving Range and Exponentially-Weighted Moving Average Charts

The objective of this research is to select the appropriate control charts for detecting a shift in the autocorrelated observations. The autocorrelated processes were characterized using AR (1) and IMA (1, 1) for stationary and non-stationary processes respectively. A process model was simulated to achieve the response, the average run length (ARL). The empirical analysis was conducted to quant...

متن کامل

An EWMA p Chart Based On Improved Square Root Transformation

Generally, the traditional Shewhart p chart has been developed by for charting the binomial data. This chart has been developed using the normal approximation with condition as low defect level and the small to moderate sample size. In real applications, however, are away from these assumptions due to skewness in the exact distribution. In this paper, a modified Exponentially Weighted Moving Av...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Pattern Recognition Letters

دوره 33  شماره 

صفحات  -

تاریخ انتشار 2012